Overview

Dataset statistics

Number of variables22
Number of observations61
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.6 KiB
Average record size in memory178.1 B

Variable types

NUM17
CAT5

Warnings

Cumulative Production (Oil) (Bbls) (3 Years) is highly correlated with Cumulative Production (Oil) (Bbls) (Current) and 3 other fieldsHigh correlation
Cumulative Production (Oil) (Bbls) (Current) is highly correlated with Cumulative Production (Oil) (Bbls) (3 Years) and 3 other fieldsHigh correlation
Cumulative Production Gas (3 Years) is highly correlated with Cumm Gas (Present)High correlation
Cumm Gas (Present) is highly correlated with Cumulative Production Gas (3 Years)High correlation
Elevation.1 is highly correlated with ElevationHigh correlation
Elevation is highly correlated with Elevation.1High correlation
Months of production is highly correlated with NUMBERHigh correlation
NUMBER is highly correlated with Months of productionHigh correlation
Avg of first year is highly correlated with Cumulative Production (Oil) (Bbls) (Current) and 1 other fieldsHigh correlation
Average Production at 24 is highly correlated with Cumulative Production (Oil) (Bbls) (Current) and 1 other fieldsHigh correlation
Average Production at 36 is highly correlated with Cumulative Production (Oil) (Bbls) (Current) and 1 other fieldsHigh correlation
S/N has unique values Unique
NUMBER has unique values Unique
Well Name has unique values Unique
API has unique values Unique
Cumulative Production (Oil) (Bbls) (Current) has unique values Unique
Cumm Gas (Present) has unique values Unique
Cumulative Production (Oil) (Bbls) (3 Years) has unique values Unique
Cumulative Production Gas (3 Years) has unique values Unique
Cumulative Production (Water) (Bbls) (3 Years) has unique values Unique
Avg of first year has unique values Unique
Average Production at 24 has unique values Unique

Reproduction

Analysis started2020-11-19 01:37:31.255145
Analysis finished2020-11-19 01:38:02.544903
Duration31.29 seconds
Software versionpandas-profiling v2.9.0
Download configurationconfig.yaml

Variables

S/N
Real number (ℝ≥0)

UNIQUE

Distinct61
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32.06557377
Minimum1
Maximum65
Zeros0
Zeros (%)0.0%
Memory size488.0 B

Quantile statistics

Minimum1
5-th percentile4
Q116
median31
Q349
95-th percentile62
Maximum65
Range64
Interquartile range (IQR)33

Descriptive statistics

Standard deviation19.14581665
Coefficient of variation (CV)0.597083239
Kurtosis-1.186370971
Mean32.06557377
Median Absolute Deviation (MAD)17
Skewness0.1208090649
Sum1956
Variance366.5622951
MonotocityStrictly increasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
6511.6%
 
3011.6%
 
2811.6%
 
2711.6%
 
2611.6%
 
2511.6%
 
2411.6%
 
2311.6%
 
2211.6%
 
2111.6%
 
Other values (51)5183.6%
 
ValueCountFrequency (%) 
111.6%
 
211.6%
 
311.6%
 
411.6%
 
511.6%
 
ValueCountFrequency (%) 
6511.6%
 
6411.6%
 
6311.6%
 
6211.6%
 
6111.6%
 

NUMBER
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct61
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19393.90164
Minimum15928
Maximum31306
Zeros0
Zeros (%)0.0%
Memory size488.0 B

Quantile statistics

Minimum15928
5-th percentile16324
Q116991
median17416
Q321728
95-th percentile26066
Maximum31306
Range15378
Interquartile range (IQR)4737

Descriptive statistics

Standard deviation3588.44076
Coefficient of variation (CV)0.1850293369
Kurtosis0.7835465363
Mean19393.90164
Median Absolute Deviation (MAD)933
Skewness1.262481329
Sum1183028
Variance12876907.09
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
1702311.6%
 
1722211.6%
 
2021011.6%
 
1709211.6%
 
1605911.6%
 
1695411.6%
 
2360911.6%
 
1592811.6%
 
2424611.6%
 
1707511.6%
 
Other values (51)5183.6%
 
ValueCountFrequency (%) 
1592811.6%
 
1605911.6%
 
1616411.6%
 
1632411.6%
 
1634611.6%
 
ValueCountFrequency (%) 
3130611.6%
 
2642611.6%
 
2640411.6%
 
2606611.6%
 
2554011.6%
 

Well Name
Categorical

UNIQUE

Distinct61
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size488.0 B
BEAR BUTTE 1-4-9H
 
1
A. BLIKRE 12-01H
 
1
BURIAN 1-27H
 
1
20711 MILDRED 94 1-H
 
1
ALICE 44X-34G
 
1
Other values (56)
56 
ValueCountFrequency (%) 
BEAR BUTTE 1-4-9H11.6%
 
A. BLIKRE 12-01H11.6%
 
BURIAN 1-27H11.6%
 
20711 MILDRED 94 1-H11.6%
 
ALICE 44X-34G11.6%
 
CLEO 1-12H11.6%
 
PARSHALL 1-36H11.6%
 
AUSTIN 22-31H11.6%
 
WARBERG 1-25H11.6%
 
BURTMAN 14-23HS11.6%
 
Other values (51)5183.6%
 
Frequencies of value counts

Unique

Unique61 ?
Unique (%)100.0%
Histogram of lengths of the category

Length

Max length32
Median length14
Mean length14.8852459
Min length10

API
Real number (ℝ≥0)

UNIQUE

Distinct61
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3304980157
Minimum3300701525
Maximum3310503202
Zeros0
Zeros (%)0.0%
Memory size488.0 B

Quantile statistics

Minimum3300701525
5-th percentile3301301414
Q13302501259
median3306100507
Q33306100707
95-th percentile3310501599
Maximum3310503202
Range9801677
Interquartile range (IQR)3599448

Descriptive statistics

Standard deviation2409868.494
Coefficient of variation (CV)0.0007291627725
Kurtosis0.1358255646
Mean3304980157
Median Absolute Deviation (MAD)795559
Skewness0.09762422649
Sum2.016037896e+11
Variance5.807466159e+12
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
330610073111.6%
 
330610066911.6%
 
330610079011.6%
 
330250125911.6%
 
330250104111.6%
 
330230110411.6%
 
330610052411.6%
 
330610231411.6%
 
330610064811.6%
 
330130141411.6%
 
Other values (51)5183.6%
 
ValueCountFrequency (%) 
330070152511.6%
 
330070163111.6%
 
330070170711.6%
 
330130141411.6%
 
330130145011.6%
 
ValueCountFrequency (%) 
331050320211.6%
 
331050314311.6%
 
331050225711.6%
 
331050159911.6%
 
330610248811.6%
 

Field
Categorical

Distinct28
Distinct (%)45.9%
Missing0
Missing (%)0.0%
Memory size488.0 B
Parshall
22 
Sanish
St. Demetrius
Little Knife
 
2
Cottonwood
 
2
Other values (23)
24 
ValueCountFrequency (%) 
Parshall2236.1%
 
Sanish813.1%
 
St. Demetrius34.9%
 
Little Knife23.3%
 
Cottonwood23.3%
 
Stoneview23.3%
 
Jim Creek11.6%
 
Epping11.6%
 
Clear Creek11.6%
 
Oakdale11.6%
 
Other values (18)1829.5%
 
Frequencies of value counts

Unique

Unique22 ?
Unique (%)36.1%
Histogram of lengths of the category

Length

Max length14
Median length8
Mean length8.344262295
Min length4

County
Categorical

Distinct7
Distinct (%)11.5%
Missing0
Missing (%)0.0%
Memory size488.0 B
Mountrail
30 
Mckenzie
Dunn
Divide
Burke
Other values (2)
ValueCountFrequency (%) 
Mountrail3049.2%
 
Mckenzie813.1%
 
Dunn711.5%
 
Divide58.2%
 
Burke46.6%
 
Williams46.6%
 
Billings34.9%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length9
Median length8
Mean length7.672131148
Min length4

Company
Categorical

Distinct11
Distinct (%)18.0%
Missing0
Missing (%)0.0%
Memory size488.0 B
EOG Resources
23 
Whiting Oil and Gas Corporation
Continental Resources
XTO Energy Inc
Petro-hunt, LLC
Other values (6)
12 
ValueCountFrequency (%) 
EOG Resources2337.7%
 
Whiting Oil and Gas Corporation914.8%
 
Continental Resources813.1%
 
XTO Energy Inc58.2%
 
Petro-hunt, LLC46.6%
 
Burlington Resources Oil & Gas Company34.9%
 
Oasis Petroleum North America LLC34.9%
 
Hess Bakken Investments II, LLC23.3%
 
Murex Petroleum Corporation23.3%
 
Iron Oil Operating LLC11.6%
 
Frequencies of value counts

Unique

Unique2 ?
Unique (%)3.3%
Histogram of lengths of the category

Length

Max length38
Median length15
Mean length20.44262295
Min length13

Cumulative Production (Oil) (Bbls) (Current)
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct61
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean530793.6393
Minimum44658
Maximum1766002
Zeros0
Zeros (%)0.0%
Memory size488.0 B

Quantile statistics

Minimum44658
5-th percentile82899
Q1193564
median386459
Q3846395
95-th percentile1104077
Maximum1766002
Range1721344
Interquartile range (IQR)652831

Descriptive statistics

Standard deviation411254.7675
Coefficient of variation (CV)0.7747921923
Kurtosis0.4540838709
Mean530793.6393
Median Absolute Deviation (MAD)288012
Skewness0.8834090908
Sum32378412
Variance1.691304838e+11
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
22729111.6%
 
48762211.6%
 
9286511.6%
 
6662011.6%
 
84639511.6%
 
24491311.6%
 
4465811.6%
 
171866811.6%
 
71082911.6%
 
91383011.6%
 
Other values (51)5183.6%
 
ValueCountFrequency (%) 
4465811.6%
 
5619611.6%
 
6662011.6%
 
8289911.6%
 
8590111.6%
 
ValueCountFrequency (%) 
176600211.6%
 
171866811.6%
 
115829511.6%
 
110407711.6%
 
108297311.6%
 

Cumm Gas (Present)
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct61
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean482305.7705
Minimum27374
Maximum2836874
Zeros0
Zeros (%)0.0%
Memory size488.0 B

Quantile statistics

Minimum27374
5-th percentile78421
Q1226358
median469079
Q3596950
95-th percentile916598
Maximum2836874
Range2809500
Interquartile range (IQR)370592

Descriptive statistics

Standard deviation422799.0614
Coefficient of variation (CV)0.8766203667
Kurtosis16.44099708
Mean482305.7705
Median Absolute Deviation (MAD)195777
Skewness3.373282062
Sum29420652
Variance1.787590463e+11
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
52249511.6%
 
13178111.6%
 
54572611.6%
 
49465211.6%
 
21932111.6%
 
65516011.6%
 
35832711.6%
 
34168611.6%
 
91659811.6%
 
40279711.6%
 
Other values (51)5183.6%
 
ValueCountFrequency (%) 
2737411.6%
 
5733911.6%
 
7393011.6%
 
7842111.6%
 
10662211.6%
 
ValueCountFrequency (%) 
283687411.6%
 
178000211.6%
 
92272211.6%
 
91659811.6%
 
83842511.6%
 

Cumulative Production (Oil) (Bbls) (3 Years)
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct61
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean340151.1311
Minimum25805
Maximum1116211
Zeros0
Zeros (%)0.0%
Memory size488.0 B

Quantile statistics

Minimum25805
5-th percentile41256
Q1123814
median205073
Q3564519
95-th percentile740438
Maximum1116211
Range1090406
Interquartile range (IQR)440705

Descriptive statistics

Standard deviation262564.0862
Coefficient of variation (CV)0.7719041982
Kurtosis-0.4228047722
Mean340151.1311
Median Absolute Deviation (MAD)143931
Skewness0.6852981534
Sum20749219
Variance6.893989934e+10
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
12262011.6%
 
62738411.6%
 
18296611.6%
 
111621111.6%
 
70750411.6%
 
6878311.6%
 
28343811.6%
 
6114211.6%
 
4125611.6%
 
56451911.6%
 
Other values (51)5183.6%
 
ValueCountFrequency (%) 
2580511.6%
 
2928611.6%
 
3416311.6%
 
4125611.6%
 
5275711.6%
 
ValueCountFrequency (%) 
111621111.6%
 
85869111.6%
 
77865211.6%
 
74043811.6%
 
70750411.6%
 

Cumulative Production Gas (3 Years)
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct61
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean241591.1803
Minimum8054
Maximum1117271
Zeros0
Zeros (%)0.0%
Memory size488.0 B

Quantile statistics

Minimum8054
5-th percentile41231
Q1105919
median220447
Q3320071
95-th percentile421725
Maximum1117271
Range1109217
Interquartile range (IQR)214152

Descriptive statistics

Standard deviation199971.37
Coefficient of variation (CV)0.8277262844
Kurtosis9.822855465
Mean241591.1803
Median Absolute Deviation (MAD)100311
Skewness2.601901901
Sum14737062
Variance3.998854883e+10
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
29977511.6%
 
10591911.6%
 
32007111.6%
 
20395811.6%
 
13572711.6%
 
38045611.6%
 
38621811.6%
 
3549711.6%
 
33501611.6%
 
39756811.6%
 
Other values (51)5183.6%
 
ValueCountFrequency (%) 
805411.6%
 
2609111.6%
 
3549711.6%
 
4123111.6%
 
4214211.6%
 
ValueCountFrequency (%) 
111727111.6%
 
108787811.6%
 
43955711.6%
 
42172511.6%
 
41261511.6%
 
Distinct61
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean68487.86885
Minimum303
Maximum339073
Zeros0
Zeros (%)0.0%
Memory size488.0 B

Quantile statistics

Minimum303
5-th percentile3071
Q120106
median29924
Q3107660
95-th percentile203137
Maximum339073
Range338770
Interquartile range (IQR)87554

Descriptive statistics

Standard deviation73296.79871
Coefficient of variation (CV)1.070215791
Kurtosis2.11923048
Mean68487.86885
Median Absolute Deviation (MAD)20216
Skewness1.541620358
Sum4177760
Variance5372420701
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
2892711.6%
 
24338011.6%
 
30311.6%
 
17323011.6%
 
3076411.6%
 
1693911.6%
 
11626611.6%
 
13182011.6%
 
7217711.6%
 
16669311.6%
 
Other values (51)5183.6%
 
ValueCountFrequency (%) 
30311.6%
 
167411.6%
 
229911.6%
 
307111.6%
 
549711.6%
 
ValueCountFrequency (%) 
33907311.6%
 
24338011.6%
 
20870911.6%
 
20313711.6%
 
19659811.6%
 

No of days produced in 3 years
Real number (ℝ≥0)

Distinct56
Distinct (%)91.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean974.7540984
Minimum721
Maximum1087
Zeros0
Zeros (%)0.0%
Memory size488.0 B

Quantile statistics

Minimum721
5-th percentile823
Q1946
median1002
Q31017
95-th percentile1050
Maximum1087
Range366
Interquartile range (IQR)71

Descriptive statistics

Standard deviation75.40947238
Coefficient of variation (CV)0.07736255996
Kurtosis1.911891818
Mean974.7540984
Median Absolute Deviation (MAD)31
Skewness-1.431028756
Sum59460
Variance5686.588525
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
101434.9%
 
100423.3%
 
101723.3%
 
101323.3%
 
102111.6%
 
92211.6%
 
105411.6%
 
93011.6%
 
100011.6%
 
80811.6%
 
Other values (46)4675.4%
 
ValueCountFrequency (%) 
72111.6%
 
76911.6%
 
80811.6%
 
82311.6%
 
82811.6%
 
ValueCountFrequency (%) 
108711.6%
 
108611.6%
 
105411.6%
 
105011.6%
 
104211.6%
 

Depth
Real number (ℝ≥0)

Distinct60
Distinct (%)98.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17826.67213
Minimum11325
Maximum22970
Zeros0
Zeros (%)0.0%
Memory size488.0 B

Quantile statistics

Minimum11325
5-th percentile13689
Q114753
median19215
Q320221
95-th percentile21587
Maximum22970
Range11645
Interquartile range (IQR)5468

Descriptive statistics

Standard deviation3007.469588
Coefficient of variation (CV)0.1687061705
Kurtosis-1.306979829
Mean17826.67213
Median Absolute Deviation (MAD)2003
Skewness-0.3107954129
Sum1087427
Variance9044873.324
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
1440023.3%
 
1459111.6%
 
1446211.6%
 
2055111.6%
 
1965311.6%
 
1491411.6%
 
1921511.6%
 
1132511.6%
 
1503511.6%
 
2297011.6%
 
Other values (50)5082.0%
 
ValueCountFrequency (%) 
1132511.6%
 
1211111.6%
 
1328911.6%
 
1368911.6%
 
1410011.6%
 
ValueCountFrequency (%) 
2297011.6%
 
2211511.6%
 
2186311.6%
 
2158711.6%
 
2123011.6%
 

Elevation
Real number (ℝ≥0)

HIGH CORRELATION

Distinct57
Distinct (%)93.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2298.311475
Minimum1912
Maximum2791
Zeros0
Zeros (%)0.0%
Memory size488.0 B

Quantile statistics

Minimum1912
5-th percentile1999
Q12181
median2303
Q32417
95-th percentile2619
Maximum2791
Range879
Interquartile range (IQR)236

Descriptive statistics

Standard deviation188.9429844
Coefficient of variation (CV)0.08220947702
Kurtosis0.07387274205
Mean2298.311475
Median Absolute Deviation (MAD)114
Skewness0.1025474333
Sum140197
Variance35699.45137
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
236623.3%
 
241723.3%
 
245023.3%
 
211323.3%
 
243111.6%
 
259911.6%
 
236711.6%
 
223711.6%
 
223611.6%
 
261911.6%
 
Other values (47)4777.0%
 
ValueCountFrequency (%) 
191211.6%
 
193211.6%
 
193811.6%
 
199911.6%
 
200511.6%
 
ValueCountFrequency (%) 
279111.6%
 
267511.6%
 
267011.6%
 
261911.6%
 
259911.6%
 

Elevation.1
Real number (ℝ≥0)

HIGH CORRELATION

Distinct59
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2274.655738
Minimum1887
Maximum2765
Zeros0
Zeros (%)0.0%
Memory size488.0 B

Quantile statistics

Minimum1887
5-th percentile1978
Q12159
median2279
Q32390
95-th percentile2598
Maximum2765
Range878
Interquartile range (IQR)231

Descriptive statistics

Standard deviation189.409247
Coefficient of variation (CV)0.08326941253
Kurtosis0.0587412872
Mean2274.655738
Median Absolute Deviation (MAD)113
Skewness0.09204551051
Sum138754
Variance35875.86284
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
229923.3%
 
197823.3%
 
214611.6%
 
208811.6%
 
224511.6%
 
211611.6%
 
228111.6%
 
249311.6%
 
223711.6%
 
223111.6%
 
Other values (49)4980.3%
 
ValueCountFrequency (%) 
188711.6%
 
190511.6%
 
191311.6%
 
197823.3%
 
200711.6%
 
ValueCountFrequency (%) 
276511.6%
 
265411.6%
 
264511.6%
 
259811.6%
 
257511.6%
 

Start Date
Categorical

Distinct42
Distinct (%)68.9%
Missing0
Missing (%)0.0%
Memory size488.0 B
8-Jun
8-Oct
 
4
8-May
 
3
8-Aug
 
3
7-Jan
 
2
Other values (37)
44 
ValueCountFrequency (%) 
8-Jun58.2%
 
8-Oct46.6%
 
8-May34.9%
 
8-Aug34.9%
 
7-Jan23.3%
 
12-Mar23.3%
 
7-Jun23.3%
 
8-Sep23.3%
 
14-May23.3%
 
13-Jul23.3%
 
Other values (32)3455.7%
 
Frequencies of value counts

Unique

Unique30 ?
Unique (%)49.2%
Histogram of lengths of the category

Length

Max length6
Median length5
Mean length5.393442623
Min length5

Months of production
Real number (ℝ≥0)

HIGH CORRELATION

Distinct45
Distinct (%)73.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean121.3606557
Minimum53
Maximum170
Zeros0
Zeros (%)0.0%
Memory size488.0 B

Quantile statistics

Minimum53
5-th percentile71
Q196
median136
Q3142
95-th percentile161
Maximum170
Range117
Interquartile range (IQR)46

Descriptive statistics

Standard deviation30.8906851
Coefficient of variation (CV)0.2545362409
Kurtosis-0.9628479746
Mean121.3606557
Median Absolute Deviation (MAD)17
Skewness-0.5242606279
Sum7403
Variance954.2344262
MonotocityNot monotonic
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%) 
14158.2%
 
14246.6%
 
13934.9%
 
7223.3%
 
13723.3%
 
15823.3%
 
13823.3%
 
15323.3%
 
13623.3%
 
13123.3%
 
Other values (35)3557.4%
 
ValueCountFrequency (%) 
5311.6%
 
6211.6%
 
7011.6%
 
7111.6%
 
7223.3%
 
ValueCountFrequency (%) 
17011.6%
 
16611.6%
 
16211.6%
 
16111.6%
 
16011.6%
 

Avg of first year
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct61
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14105.19204
Minimum1275
Maximum41402
Zeros0
Zeros (%)0.0%
Memory size488.0 B

Quantile statistics

Minimum1275
5-th percentile2789
Q15602
median10825
Q322495.5
95-th percentile29270.71429
Maximum41402
Range40127
Interquartile range (IQR)16893.5

Descriptive statistics

Standard deviation10024.99536
Coefficient of variation (CV)0.7107308665
Kurtosis-0.7819906062
Mean14105.19204
Median Absolute Deviation (MAD)7574.83333
Skewness0.5581230356
Sum860416.7143
Variance100500532
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
20906.511.6%
 
453911.6%
 
31112.8333311.6%
 
471811.6%
 
6128.91666711.6%
 
583411.6%
 
291511.6%
 
730511.6%
 
592211.6%
 
31320.5833311.6%
 
Other values (51)5183.6%
 
ValueCountFrequency (%) 
127511.6%
 
190311.6%
 
235711.6%
 
278911.6%
 
291511.6%
 
ValueCountFrequency (%) 
4140211.6%
 
31320.5833311.6%
 
31112.8333311.6%
 
29270.7142911.6%
 
28295.3333311.6%
 

Average Production at 24
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct61
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7149.57377
Minimum203
Maximum24964
Zeros0
Zeros (%)0.0%
Memory size488.0 B

Quantile statistics

Minimum203
5-th percentile582
Q12058
median3720
Q312530
95-th percentile17989
Maximum24964
Range24761
Interquartile range (IQR)10472

Descriptive statistics

Standard deviation6483.445993
Coefficient of variation (CV)0.9068297218
Kurtosis-0.2144973716
Mean7149.57377
Median Absolute Deviation (MAD)2824
Skewness0.911396676
Sum436124
Variance42035071.95
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
524511.6%
 
95211.6%
 
696411.6%
 
1445511.6%
 
260511.6%
 
1626611.6%
 
180011.6%
 
708011.6%
 
336511.6%
 
618011.6%
 
Other values (51)5183.6%
 
ValueCountFrequency (%) 
20311.6%
 
21711.6%
 
44111.6%
 
58211.6%
 
88711.6%
 
ValueCountFrequency (%) 
2496411.6%
 
2332411.6%
 
1875111.6%
 
1798911.6%
 
1766411.6%
 

Average Production at 36
Real number (ℝ≥0)

HIGH CORRELATION

Distinct60
Distinct (%)98.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4894.590164
Minimum265
Maximum21390
Zeros0
Zeros (%)0.0%
Memory size488.0 B

Quantile statistics

Minimum265
5-th percentile641
Q11356
median2902
Q37210
95-th percentile12936
Maximum21390
Range21125
Interquartile range (IQR)5854

Descriptive statistics

Standard deviation4614.706818
Coefficient of variation (CV)0.9428178178
Kurtosis2.241922008
Mean4894.590164
Median Absolute Deviation (MAD)2228
Skewness1.467347352
Sum298570
Variance21295519.01
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
64123.3%
 
119311.6%
 
312611.6%
 
107911.6%
 
222811.6%
 
747511.6%
 
137511.6%
 
107311.6%
 
209611.6%
 
721011.6%
 
Other values (50)5082.0%
 
ValueCountFrequency (%) 
26511.6%
 
49211.6%
 
55411.6%
 
64123.3%
 
64511.6%
 
ValueCountFrequency (%) 
2139011.6%
 
1758511.6%
 
1606711.6%
 
1293611.6%
 
1271211.6%
 

3 years production decline
Real number (ℝ≥0)

Distinct55
Distinct (%)90.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean67.47377049
Minimum20.6
Maximum95.8
Zeros0
Zeros (%)0.0%
Memory size488.0 B

Quantile statistics

Minimum20.6
5-th percentile43.9
Q162.3
median69.4
Q376.8
95-th percentile84.4
Maximum95.8
Range75.2
Interquartile range (IQR)14.5

Descriptive statistics

Standard deviation14.13784403
Coefficient of variation (CV)0.2095309619
Kurtosis2.40967071
Mean67.47377049
Median Absolute Deviation (MAD)7.4
Skewness-1.184337291
Sum4115.9
Variance199.8786339
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
61.423.3%
 
67.623.3%
 
62.323.3%
 
79.323.3%
 
77.623.3%
 
63.223.3%
 
47.911.6%
 
20.611.6%
 
56.911.6%
 
61.111.6%
 
Other values (45)4573.8%
 
ValueCountFrequency (%) 
20.611.6%
 
21.811.6%
 
40.311.6%
 
43.911.6%
 
45.111.6%
 
ValueCountFrequency (%) 
95.811.6%
 
89.411.6%
 
86.111.6%
 
84.411.6%
 
84.311.6%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

S/NNUMBERWell NameAPIFieldCountyCompanyCumulative Production (Oil) (Bbls) (Current)Cumm Gas (Present)Cumulative Production (Oil) (Bbls) (3 Years)Cumulative Production Gas (3 Years)Cumulative Production (Water) (Bbls) (3 Years)No of days produced in 3 yearsDepthElevationElevation.1Start DateMonths of productionAvg of first yearAverage Production at 24Average Production at 363 years production decline
0116164PARSHALL 1-36H3306100499ParshallMountrailEOG Resources23326512624613432050966303103411325193819136-May1666128.9166672058137577.6
1216324PARSHALL 2-36H3306100503ParshallMountrailEOG Resources38645919780720431875834167499712111204220166-Sep1628043.6666674212312661.1
2316346BARTELSON 1-3H3306100504ParshallMountrailEOG Resources54944831178831732812239310637102614900205420286-Nov16011405.0833308397420063.2
3416370WARBERG 1-25H3306100507ParshallMountrailEOG Resources456873226358274802105919229995513289210620807-Jan15810645.8333306180313770.5
4524281PARSHALL 32-0225H3306102314ParshallMountrailEOG Resources71082950057053602830156563304956195742005197813-Mar8427575.3333308272617977.6
5616483ZACHER 1-24H3306100516ParshallMountrailEOG Resources6247453416863429031298129708105414400216721467-Jun15314145.9166707080458867.6
6716497HOFF 1-10H3306100520ParshallMountrailEOG Resources4876222511832834381040641808089114328199919787-Jun15314432.4166705739280980.5
7825253PARSHALL 35-0509H3306102488ParshallMountrailEOG Resources539028600698396910220447196598808215872113208813-Oct7715893.16667010212546965.6
8916550RALPH 1-32H3306100524ParshallMountrailEOG Resources455154144607252355738364978992214100202720077-Sep15112267.8333305245438164.3
91017912SORENSON 11-3H3306100963SanishMountrailWhiting Oil and Gas Corporation873459655160586996328594296701014202212143211610-Feb12125651.66667013956962062.5

Last rows

S/NNUMBERWell NameAPIFieldCountyCompanyCumulative Production (Oil) (Bbls) (Current)Cumm Gas (Present)Cumulative Production (Oil) (Bbls) (3 Years)Cumulative Production Gas (3 Years)Cumulative Production (Water) (Bbls) (3 Years)No of days produced in 3 yearsDepthElevationElevation.1Start DateMonths of productionAvg of first yearAverage Production at 24Average Production at 363 years production decline
515626426BRAYDEN ALEXANDER FEDERAL 20-17H3302301104Writing RockDivideMurex Petroleum Corporation92865573396209426091339073871186602144212114-May722915.0182764178.0
525731306BURTMAN 14-23HS3302301352BurgDividePetro-hunt, LLC9844710662283665857692433801014191572113208715-Oct533971.02066104473.7
535820814MCGINNITY 3-15H3302300720StoneviewDivideContinental Resources1673883140621230452039581435981017195532335231012-Mar1005834.01930135676.8
545915928ANHELUK 44X-233300701525St. DemetriusBillingsXTO Energy Inc10496215382961142611432992499517591267526546-Jan1703371.058267480.0
556017737ARMSTRONG 1-24H3300701631St. DemetriusBillingsContinental Resources8289973930527574214223701100019627261925989-Apr1312789.0181664576.9
566121806BURIAN 1-27H3300701707St. DemetriusBillingsContinental Resources269265533911205063320071828121042206102670264512-Feb9810825.03365238977.9
576222045ACKLINS 6092 12-18H3301301627CottonwoodBurkeOasis Petroleum North America LLC2052932733021303631028101842941005186502417239212-Mar964718.02808178062.3
586320859CARLSON 159-94-4B-9-1H3301301573North TiogaBurkePetro-hunt, LLC10253414146868783863762087091017192152395237212-Aug923077.01800107964.9
596417777DICK 44X-183301301450StoneviewBurkeXTO Energy Inc5619678421292863549734132102114622245024339-Mar1321275.044149261.4
606517278LUCY 11-23H3301301414CottonwoodBurkeOasis Petroleum North America LLC44658273742580580543622294414462244624288-Sep1381903.021726586.1